Corpus: eng-dm_web_2015_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 95 99 99 99 99
1000 745 945 984 990 991
10000 4325 8100 9334 9716 9834
100000 4325 8101 9335 9717 9835
1000000 4325 8101 9335 9717 9835


Zipf's diagram for sentence endings


Gnuplot diagram

871 msec needed at 2018-04-13 12:53